THE SHARP FORM OF THE STRONG SZEGO 
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BARRY SIMON 
In memoriam, Robert Brooks {1952-2002) 

Abstract. Let / be a function on the unit circle and Dn{f) be 
the determinant of the (n + 1) x (n + 1) matrix with elements 
{cj-i}o<i,j<n where = /,„ = / e"""^/(6i)|^. The sharp form 
of the strong Szego theorem says that for any real- valued L on the 
unit circle with L,e^ in L^i^), we have 



lim D^{e^)e-'^''+^^^° 



exp 



fc=i 



where the right side may be finite or infinite. We focus on two 
issues here: a new proof when e'^ — » L{d) is analytic and known 
simple arguments that go from the analytic case to the general case. 
We add background material to make this article self-contained. 



1. Introduction 



Let {cm}m=-oo be a two-sided sequence of complex numbers. A 
Toeplitz matrix is a finite matrix constant along diagonals: 



T 



n+1 



/ Co 

C-1 

Vc-n 



Cl 
Co 

-n+1 



C2 
Cl 



Cn \ 

Cn-1 

Co / 



It turns out that the natural way to label T is in terms of the Fourier 
transform of c, that is, 



imd 



;i.2) 



Date: January 21, 2004. 

^ Mathematics 253-37, California Institute of Technology, Pasadena, CA 91125. 
E-mail: bsimon@caltech.edu. 

2 Supported in part by NSF grant DMS-0140592. 

To appear in Proc. Conf. on Geometry and Spectral Theory. 

1 



2 



BARRY SIMON 



on dB (p = {z \ \z\ < 1}, (9© = {2 I \z\ = 1}). / is called the symbol 
of the Toeplitz matrix. 

One can define a symbol as a distribution so long as |cm| is poly- 
nomially bounded in m, but we will discuss the case where there is a 
signed measure, dfi, so that 

Cn = j e-''^' dfi{9) = ft„ (1.3) 

As usual, for / G L^(9D, |^), we define /„ to be the Fourier coefficients 
of the measure /|^. We will most often discuss the case where dfi is 
absolutely continuous, that is, dfx = w{9)^ and where w > or even 
that w = e^. 

Dn{dfi) is the determinant of T„+i. The strong Szego theorem says 
that if L, G with L real, then 

(in \ °° 
— ) ~ (n + l)Lo + l^^l' (1-4) 

fc=i 

There are a number of remarkable aspects of (jl.4|) . The first term 
was found in 1915 and the second in 1952. Despite the 37-year break, 
they were both found by Szego — the twenty- year old in 1915 and 
the 57-year old in 1952 j^! You might wonder about whether ()1.4|) 
is the leading term in a systematic 1/n series. In fact, if L is real- 
valued and if e*^ ^— L{6) is analytic in the neighborhood of dV>, then 
the error in ()1.4j) is 0(e~^") — there are no more terms in the series 
(this follows from (j2.21|) and (j5.17p below). Lest you be shocked by 
this, we note that for many models in statistical mechanics, the free 
energy has a volume term, a surface term, and then, if the interaction 
is short-range, exponentially small errors. 

A second remarkable aspect is the subtlety. Why should logw enter 
at all, and then in both linear and quadratic terms? There is a fascina- 
tion with this subject among mathematicians who have extended the 
result both in the context of function algebras |HZ1 113 122] and in the 
context of pseudodifferential operators on manifolds j2Hl HI] (see |2H] 
for literally dozens of papers on each aspect). 

A third aspect is that there are a remarkable number of applications 
of this result. Szego returned to find the second term because of a 
question raised by Onsager who ran into Toeplitz determinants in his 
work on the Ising model (see [23 H] for a discussion of this). They 
enter in the study of some Coulomb gases [201 |2I1 CD] and in electrical 
engineering applications [IHl E]- And they have had a surge of interest 
recently because of their role in random matrix theory |24j . 
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When Szego [221 proved (jl.4p . he assumed L was C^+^. There were 
many papers on this subject which improved this incrementally until 
Ibragimov ^^1; fifteen years later, proved the following sharp form: 

Theorem 1.1 L^t L he a real-valued function on (9D so that 

L,e^ e Li((9©, f ). Then 

hni^D„(^e^^^e-("+i)^» =exp(^^|A;||Lfep^ (1.5) 



Thus, p.4|) holds whenever the right side makes sense, that is, L in 
if^/^, the Sobolev space of order i. This should be supplemented with 
a result of Golinskii-Ibragimov [12]: 

Theorem 1.2. If d^i = + dfXs with dfXs singular is a positive 

measure on (9D and lim„_^oo -D„((i/i)e~^""'"^^^'' < oo, then dfis = 0. 

The combination of these two theorems has a spectral theory con- 
sequence that links it up to the theme of the conference and to Bob 
Brooks' interests. As we will see in Section |21 probability measures on 
dB> have associated parameters {a„((i/i)}5^Q called Verblunsky coeffi- 
cients. It can be shown using Theorems 11.11 11.21 and 12.41 that 

Theorem 1.3. Let dfi be a probability measure on d3 and {an{d^)}'^^Q 
its Verblunsky coefficients. Then the following are equivalent: 

(ii) dfis = and djj, = where Efe^i ^l-^fcP < oo. 

This is one of those gems of spectral theory that give necessary and 
sufficient conditions relating properties of a measure and its inverse 
spectral parameters. 

We have two main themes in this article. First, we wish to note 
that despite it taking fifteen years to go from L G C^^^ to L G if 
there is an elegant and simple argument to do this jump. This combines 
arguments of Golinskii-Ibragimov ^2] and Johansson ^Tj whose proofs 
have "easy" halves that handle opposite sides of the extension. It 
does not appear to be widely appreciated that their arguments can be 
combined in this way. In fact, these results reduce ()1.4p to the case 
where the decay exponentially, that is, e*^ (—> L{d) is real analytic 
on dB. 

Second, we have a new proof of ()1.4|) in this real analytic case that 
is perhaps less mysterious than the elaborate calculation in Szego 
From our point of view, the two terms in ()1.4|) come from two terms in 
the Christoffel-Darboux formula for z = e^^ . 
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While the arguments of ^21 1^ are simple, they depend on consid- 
erable general machinery relating Toeplitz determinants to orthogonal 
polynomials on the unit circle (OPUC) on the one hand, and to the sta- 
tistical mechanics of Coulomb gases on the other (both themes go back 
to Szego's early work: [22] has the Coulomb gas representation and the 
main point of jHOl EI] is to discuss the connection to OPUC), so this 
article, in attempting to be self-contained, provides this background. 

Sections 121 and El discuss the basics of OPUC. In SectionjH we get the 
leading term in (jl.4j) . not only for its own sake, but to define in Sectional 
the Szego function which will play a critical role in many aspects of 
the remainder. These preliminaries allow us to present the Golinskii- 
Ibragimov half of the extension in SectionlHl After proving the Coloumb 
gas representation in Section [71 we can prove the Johansson half of the 
extension in Section |H1 The final three sections provide the proof of 
()1.4j) in the analytic case: Section 121 has a preliminary proving the 
Christoffel-Darboux formula, and the last two sections finish the proof. 

I have written a comprehensive book on OPUC |2H1 and everything 
in this paper appears there, but it seemed sensible, given the fact that 
the material is spread through a long book, to pull out exactly what is 
needed to prove Ibragimov's theorem. 

While ()1.4j) is sharp in one sense, it is not the end of the story by 
any means. First, there is a simple argument of Johansson that 
drops the requirement of reality from L: if e^,L E and L G H^^'^, 
then an extension (jl.4|) holds in the sense that 



There are also subtleties in extending ()1.4|) to allow matrix- valued sym- 
bols, to allow complex w's with nonzero winding number, and to deter- 
mine the leading behavior when L ^ L^. The reader can consult |2S| 
for references on all these issues. 

Over the course of studying asymptotics of Toeplitz determinants, I 
have learned a lot in discussions with Percy Deift, Rowan Killip, and 
Irina Nenciu, and I would like to thank them for their insights. 

Bob Brooks was a substantial mathematician and wonderful person. 
We lost him too soon. I'm glad to dedicate this article to his memory. 
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2. Verblunsky Coefficients and Toeplitz Matrices 

If c„ are the moments of a measure, /x, it is natural to form the monic 
orthogonal polynomials, ^n{z;dfi), defined by 

= + lower order (2.1) 
{z^,^^)^ = ifj = 0,l,...,n-l (2.2) 

where 



if, 9) = J f{e'')9{endM (2.3) 

is the L^(c}D, d^) inner product. In order to form for all n, we need 
the measures dfi to be nontrivial, that is, not supported on a finite set 
of points. 

The matrix elements Ck-e = J e'-^^^^^^ dfi = so Dn^djj) is a 

Gram determinant. Such determinants allow change of basis, that is, if 
Pk{z) = + lower order, det {{Pk, Pe))o<k,e<n = Dn{dfi). We can take 
Pk = $fc, in which case, {Pk, Pe) is a diagonal matrix (!), and so. 

Theorem 2.1. 

n 

D4dfi) = l[\\<l>,\\% (2.4) 

j=0 

is the orthogonal projection in of onto the orthogonal com- 
plement of [1, ... , z^~^], the span of {1, ... , z^~^}. Since multiplication 
by z is unitary, z^j is the projection of z^^^ onto [z, . . . , z^]^ while 
^j+i is the projection of z^^^ onto [1, . . . , z^^, so 

||$,+i|| < = ||$,|| (2.5) 

Thus, ||$j|| is decreasing in j. It follows that 

Theorem 2.2. 

(a) hm D^idf^y/^^' = hm = lim ||$„f (2.6) 

(b) %i < #^ (2.7) 

These ideas all go back to Szego [201 IHI] • The next idea, which is a 
set of recursion relations of the $„, was first written down by Szego P^ . 
but the basic parameters occurred in a related context in Verblunsky 
j35t IHE] . To state them, we need to define the reversed polynomials 



K{z)=z^^^{llz) {21 
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Theorem 2.3. For any nontrivial measure, dfj,, there exists a sequence 
of numbers {a„}^o that 

= z$4z) (2.9) 

Moreover, G © and 

\\<!>n+lf = {I - lanl^mnf (2.10) 

and 

n— 1 

||<l>„f = co(rf/i) J](l- |a,f) (2.11) 

j=0 

Remarks. 1. In the next section, we wiU see that fi h-> {a„}J^Q is 
one-one for yu's which are normahzed. 

2. There is a converse going back to Verblunsky j^HI (see j2Hl for 
many other proofs) that the map from probabihty measures to X D 
hj fi^ {an}n=o is onto. 

3. The an are caUed the Verblunsky coefficients for d^. 

4. Applying Q*{z) = z"+^g(l/2) to ((231) yields 

K^.iz) = Ki^) - anz^niz) (2.12) 

5. The proof below is a variant of one of Atkinson fT^. Szego's proof 
first proves the Christoffel-Darboux formula (see Section and uses 
that to prove 

Proof. Let Vng = z^g on L'^{dfi). Vn is anti-unitary, maps Vn, the poly- 
nomials of degree n, to themselves, and maps $„ to $* . Since $„ is the 
unique element (up to constants) of Vn orthogonal to {1,2;,..., z"~^} 
and Vn is anti-unitary, $* is the unique element of Vn orthogonal to 

{Vnl,...,VnZ^-'} = {z^,Z^-\...,z}. 

Now, for j = 1, . . . , n, 

{Z^,z<l>n) = {z^~\<l>n)=0 

and clearly, {z\^n^i) = 0. Thus, 

{Z^,<^n+l - Z^n) = 

for j = 1, . . . , n. Since and ^n+i are monic, $„+i — 2;$„ G Vn- So, 
by the first part of the proof, ()2.9p holds for a suitable constant an- 

Thus 

an = -$n+i(0) (2.13) 
Since $* ± ^n+i, we have 

||$„,f = ||2;<l>„f = ii^^+i + an*;!!^ 

= ||$„+if + |a„,n|$„||2 
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which imphes (j!2.1(Jj) . (j!2.1(Jj) in turn imphes < 1. (j2.1H) follows by 
induction and ||$oP = PIP = Co{dfi). □ 

This leads to 

Theorem 2.4. Suppose J d^ = l. We have 

^(*'%"?iw^^(l-la,^ (2,14) 

where the product always converges although the limit may he zero. 
IfF{dfx) > 0, then 

Gn = ^ (2.15) 

obeys 

Gn+l > Gn (2.16) 

The limit always exists {but may be infinite) and is given by 

oo 

G{dfi) = lim Gnidfi) = n (1 - lajf)"^^^ (2-17) 

j=0 

Remarks. 1. In particular, 

oo 

F>0^^|a„p<oo (2.18) 

n=0 

G < OO ^ ^(n + l)|a„p < OO (2.19) 

n=0 

2. (j2.14p in a sense goes back to Verblunsky jSH]- (|2.17p seems to 
have only been noted by Baxter many years later. 

3. If Co 7^ 1, F^djj) = Co njlo(-'- ~ I'^iP) while G{dfj,) is still given by 
(ETfll . Indeed, G„(rf/i) = Gn{d^i/ J d^). 

4. ()2.7|) says logD^ is concave in n. The monotonicity of Gn is a 
standard fact about concave functions with finite asymptotics. 

Proof. By and then (ITTT|l . 

Dn+l 



D. 



n 



|<l>„+i|| = J](l-|«,f) (2.20) 

j=0 

(if = Co = 1). From this and \aj\ < 1, ()2.14j) is immediate. If F 

is nonzero, 

f^i.= TT (l-|a,f)-i 

" j=n+l 
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SO that 



F 



k=l >- 

oo 



Dk-1 F 



= Y[{l-\a/y"'"^"''^~^ (2.21) 
i=o 

from which Gn+i > G„ and ()2.17|1 are immediate. □ 

3. Bernstein-Szego Approximations 

Given a nontrivial probabihty measure, dfi, with Verblunsky coeffi- 
cients {c(n}'^=o, we will identify the measures, dfx^^\ with 

= l"^-^^^) j = 0,l,...,iV-l 

and see d/i^'^'' djj, weakly. In many ways, the general proof of the 
strong Szego theorem will play off this approximation and the distinct 
approximation obtained by truncating the Fourier series for L in e^^. 
As a preliminary, we need 

Theorem 3.1. ^n{z) has all its zeros in D. ^ni^) ^^■^ zeros in 

C\D. 

Remark. This proof is due to Landau JH]- See [2H1 for many other 
proofs of this fact. 

Proof. Let $n(-2o) = 0. Then 7r„_i = ^n{z)/{z — zq) is a polynomial of 
degree n — 1, and so (7r„_i, $„) = 0. Since {z — zo)7in-i = $n, 

Ikn-ir = Ik7r„_i||^ = ||2;o7r„_i + <l>„||^ 
= |2;on|7r„_if + \\<l>nf 

or 

(l-l^oHlkn-lf =||<fnf (3.2) 

from which we conclude \zo\ < 1, that is, the zeros of lie in ©. Since 
$;(zo) = if and only if $„(l/^o) = 0, the zeros of lie in C\D. □ 

Given /i, define a measure, p.^^\ by 

which can be defined since $7v(e*^) 7^ for all 6 by Theorem 13.11 We 
have 
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Lemma 3.2. For j > 0, 

^N+jiz; dfi^^'^) = z^<^Niz; d^i) (3.4) 



= (3.5) 



Moreover, 

for £>N. 
Proof. Since 

|$;v(e^')|' = e-^^^$;v(e^')$^(e*') (3.6) 
we have for k^Z, k<N — 1 (includes A; < 0), 



= 

since — A; — 1 > and $^(2;)^^ is analytic in a neighborhood of 
by Theorem 13.11 This says 



^^^$^)^(iv) =0 (3.7) 



for £ = 0, 1, . . . , + j — 1. Since z^^^ is monic, ()3.4|) holds. 

By ^rn^ . if $fe+i(0) = 0, then = 0, so (|S3I) implies 
$Ar+j(0; djl^^^) = 0, which in turn implies ()3.5|) . □ 

Theorem 3.3 (Geronimus ^l]). -^et dfi, dv he two nontrivial measures 
on Suppose that for some fixed N, 

^n{z] d/i) = ^n{,z] dp) (3.8) 

Then 



^j{z; dfi) 


= ^j{z] dv) 


J 


= 0,1,., 


..,N-1 


(3.9) 


aj{djj,) = 


aj{dv) 


3 


= 0,1,., 


..,iV-l 


(3.10) 


Cj {djj) 


Cj{dv) 


J 


= 0,1,., 


..,N 


(3.11) 


Co{dn) 


coidu) 




Coidii)co{duy^\\^j\\l^ 


j 


= 0,1,., 


..,N 


(3.12) 



Proof. fj2.9p and ()2.12j) can be written in matrix form 

'^j+ii^A _( z -"A ( ^liz) 



(3.13) 



The 2x2 matrix in (j3.13j) has an inverse z ^Pj^ { a]z °z ) where 

p. = (i-i«.r)'/' (3.14) 
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Thus ()3.14j) implies the inverse Szego recursions: 

= /'7'ft"+i(^) + (3-16) 

()H.8j] implies 



a]\f{dfi) = —$Ar(0; (i/i) = — $Ar(0;(iz/) = a^idiy) 

and thus, by (ITTHIl with j = - 1, we have for j = iV - 1. By 
iterating this argument, we conclude that (j3.9p and (j3.1(J|) hold. 

We need only prove ()3.1H) and ()3.12j) . assuming co{dfi) = coi^du) = 
1 since with dp, = dfj,/co{d^), we have Cn^djj) = co{dfi)cn{dp,) and 
||$^.||2^ = co(rf/i)||$j||i^. (HTT^ is immediate from (ETTll . 

We prove (j3.11|) when Co{d^) = Co^du) by induction and noting 
= yields a formula for Cfc in terms of the coefficients of 
and Co, ci, . . . , Cfc_i. □ 

Remark. The last paragraph of the proof shows that the a's deter- 
mine the c's and proves that the map of /i to a is one-one. 

To succinctly state this section's final result, we introduce 

fN{z; dfi) = (3.17) 
W^nW 

the orthonormal polynomials. By ()2.1H) . 

N-l 

(Pn{z\ djj) = Yl pJ^Co{dny^^^^N{z; dp) (3.18) 



3=0 



We define 



71—1 \ 

\{pA{co{dp)V = \\^nV (3.19) 

j=0 ^ 



and 



/too = lim (3.20) 

n^oo 



which exists (but it may be +oo) by < 1. The infinite product in 
()3.19|) . and so Kqo, is finite if and only if 



oo 



/too < oo <^=^ Ictjl^ < OO (3-21) 

We also note the translation of ()2.9j) / ()2.12j) from $ to ^9: 

<^n+i(2:) = p~^{z^n{z) - an^l{z)) (3.22) 
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•^l+liz) = Pn\'^*niz) - OinZ<^n{z)) (3.23) 

Theorem 3.4. Let dfi be a nontrivial probability measure on 83. De- 
fine 

d^W = , (3.24) 

Then dfx^^^ is a probability measure on 9© for which ()3.1|) holds. As 
N — > oo, djj.^^'^ — > dn weakly. 

Remark. We call measures of the form ()3.24|) BS measures and rf/i*^^) 
the BS approximation. 

Proof. Since is a multiple of dn^^\ we have the bottom half of 

(EH) by dnUl). Since (Q holds for j = 0, Theorem O and (nTTTll) 
imply the top half of p.ip . 

Since $7v = II^Afl^V^Af, we clearly have 

ll'^Af||^{iV) = ll'^Afll^ 

SO, by (j3.13|l . coidfj,^'^^) = co^d/j,) = 1, that is, dfiN is a probability 
measure. 

By the above and ()3.1H) . we have 

Cjidfx^^'^) = Cjidfj.) j = 0,l,...,N (3.25) 

This and its complex conjugate implies that for any Laurent polyno- 
mial, / (polynomial in z and z~^), 

lim [ f{e'')dfi^''^ = [ f{e'^)dfi (3.26) 

N^oo J J 

since the left side is equal to the right for large. Since Laurent 
polynomials are dense in C(9D), (j3.26j) holds for all /, that is, we have 
weak convergence. □ 



We note we have proven that 



a 



a,- 3 < N -1 



The ideas of this section go back to Geronimus jTTJ and were redis- 
covered in and [7]. In particular, the use of inverse recursion to 
prove Geronimus' theorem is taken from (7j. 
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4. SzEGo's Theorem 
In this section, our main goal is to prove 
Theorem 4.1. For any w E L^{^), 

lim ^\ogDNiw)= [\ogw{e)^ (4.1) 

Remarks. 1. Since log(x) < x — 1 and w{6) G L^, 
J max(0, log w(x))|^ < oo, so the integral on the right side of (jl.lj) 
is either convergent or diverges to — oo, in which case ()4.1|) says 

D^iwy/^ ^ 0. 

2. This was conjectured by Polya [211 proven by the twenty- 
year old Szego in 1915 [201 • Our proof here is essentially the one Szego 
presented in jSOl EI] • 

3. The same result is true for the symbol dfi = w{9)^ + dfig, that 
is, the limit is independent of dfi^- This extension was first proven by 
Verblunsky jSH]. We will not prove this more general result here (j2H] 
has four different proofs in Chapter 2) since it is peripheral to our main 
result. 

The first half of the theorem is a simple use of Jensen's inequality: 
Proposition 4.2. Let w = with w,L E . Then 



l*»eg>exp(/L(»)| 



(4.2) 



In particular, 



Dniw)>exp[ l{n + l)L{e)^\ (4.3) 



Proof. follows from and ^T^ . To prove we write 

/df) 
exp(21og|$:(e^^)|+L(e^^))- 

> exp(^| [21og|<|.:(e^^)| + L(e'^)]) ^ (4.4) 

by Jensen's inequality. 

By Theorem 13. H log($*(2;)) is analytic in ©, so 

/log|$:(e^^)||^ = Re|log($:(e^^))|^ 

=iog|$:(o)i = o 

since monic implies $*(0) = 1. □ 
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The other half of the theorem depends on a variational principle 
noted by Szego: 

Proposition 4.3. We have for any w E |^), 



hm [D^iw)]'/^ = inf ( / \f{e^')\MO) ^ 



/Gi/~(D); /(0) = 1 

(4.5) 

Remark. H°°{3) is the Hardy space of bounded analytic functions 
on D. By general principles |H1I2Z|, for |^ a.e. e*^ G 9D, lim.r|i /(re*^) 
exists, and that is what we mean by /(e*^) in ()4.5|) . 

Proof. Since ||$;|| = ||<l>„||, by (jZE)), 

LHS of ^ = lim 11$; II 2 

n— >oo 

= inf||<l>;f (4.6) 

n 

by (j2.5|) . By the argument at the start of the proof of Theorem 12.31 

$* = TT 1 

where 7r„ is the projection in the space of polynomials, Vn, of degree 
n onto the orthogonal component of the span of z, z"^ , z^, . . . , z"'. This 
span is {P G Vn \ -P(O) = 0}, so by standard geometry, 

llCf = inf{||Pf \PeVn, P{0) = 1} (4.7) 

proving again that ||$j!j|P is decreasing in n and proving ()4.5|) if H°° is 
replaced by the set of all polynomials. 

To complete the proof, we need only show that for any / G H°° with 
/(O) = 1, there are polynomials Pi{z) so that -P^(O) = 1 and 

me^')\MO)^^ [ me^')\MO)^ (4.8) 

ZTT J ZTT 

If / is analytic in a neighborhood of D, the Taylor approximations 
converge uniformly on D, so ()4.8|) holds. For general /, by the domi- 
nated convergence theorem, 

hm I \f{re^')\MO)^ = J \f{e'')\MO)^ 

so, by a two-step approximation, we find P^s so (j4.8j) holds. □ 

Proof of Theorem \4.1\ We will prove that for any e > 0, there is an / 
in with /(O) = 1 and 

/ \f{e'')\MO) f < exp(^| \og{w{e)+e) ^ (4.9) 



14 BARRY SIMON 

SO taking e I yields the opposite inequality to ()4.3|1 . 
Define 

g{z) = exp (- I \og{w{e) + a) (5^) ^) (4-10) 

and f{z) = g{z)/g{0), so /(O) = 1. Moreover, \g{z)\ < e^^l'^ by the 
fact that the Poisson kernel 

(pid _|_ i-f' 

is nonnegative and J ^P^iQ^i^) = 1. By standard maximal function 
arguments (see [211), = l^(^) "'^''^5 so 

\fie'')\MO) ^ < 9(0)-' = RHS of (Q 



□ 



5. The Szego Function 



When w{9) = e^^^^ with L G L^, Szego jHOlini] introduced a natural 
function, -0(2;) on ©, which will play a critical role in several places 
below: 



(5.1) 



Do not confuse Dn and D{z). Both symbols are standard, but the 
objects are very different. 

Theorem 5.1. (a) // (j2T8|l holds, then for \z\<l, 

D{z) = D{Q)e^JY.Lkz''\ (5.2) 
^ k=i ' 

^(0)= CoU|^ 11(1- Kl')'^' (5.3) 

\ /J n=n 



n=0 

(b) D(z) lies m H\]D)). 

(c) lim^ii -D(re*^) = D{e^^) exist for a.e. 6 and 

\D{e'')\^ = w{e) (5.4) 

(d) D is nonvanishing on D. 
Proof (a) We get and 

L>(0) = exp(iLo) (5.5) 
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from (|5.1|) and 



id 

l + 2^(e -z) 

j=0 



- z 



uniformly in e*^ G and z G {\z\ < r}. ()5.3p then follows from ()2.14|) . 
dHH), and (jK3|l . 

(b) Let D^^'^\z) be given by ()5.1|) with L{9) replaced by 
min(L(^),M). Then D^*^) G and |L)(^^)(e^^)|2 = min(w;(^), e^*^), 

sup r \D'-''\re'')\^ — = r ^m{w{e),e''') — 

0<r<l Jo 27r Jq 271 

< / w{e) - 

Jo 

Thus, taking M — oo, we see D G H'^. 

(c) is immediate from properties of boundary values of the Poisson 
integral in (jSHJ. 

(d) is trivial from □ 

The following simple but powerful calculation goes back to Szego 
[101 EI] (it has a version when dfig ^ 0; see [2H]): 

Theorem 5.2. Let dfi = where w = e^, L E L^. Then, as 
n —>■ oo, 



l^^:(e^')-lP^-0 (5.6) 

(ii) j \^:ie'') - D-\e^')\' d^^ ^ (5.7) 

(iii) ifUz) ^ Diz)-' (5.8) 
uniformly on compact subsets o/D. 

Proof, (i) By Q, /l/^^Pf = Jl^nl'^dfx = 1, so ^ is equivalent 
to 

Re I K{e^')D{e^') ^ - 1 (5.9) 

Dip"^ is in if^, so the Cauchy integral formula applies, that is, 

LHS of dnSI) = <(0)D(0) 

= ^n^^' (5.10) 

since <l>;(0) = !,(/?„ = and (Q and imply D(0) = k^. 

(j5ln|l implies (jEH)). 

(ii) is immediate from ()5.(ij) and dfj, = = |-Dp|f by ()5.4|1 . 
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(iii) ()5.fij) says Dip*^ — >■ 1 in and so, a fortiori, uniformly on com- 
pact subsets of ©. □ 

We need to extend this result to a neighborhood of D when L is real 
analytic. As a preliminary, we note: 

Lemma 5.3. Let dfx = be a probability measure with logw G 
L^(f ). Then 

an = -«:oo J ^n+i{e'') D{e'')-' d^x{e) (5.11) 
Proof. We will prove for m > n + 1 that 

«n = -Km j ^n+l{e'')^U(i'') d^l{e) (5.12) 

from which (|5.11|) follows from (|5.7|) . 

ip*^ is orthogonal to so if P is any polynomial of degree at 

most m with P(0) = 0, 

j P{^) pUe'') dm = ^ (5.13) 
()5.12|) follows from ()5.13|) by taking 

which has -P(O) = + Km(— «n) =0. □ 

The following is due to Nevai-Totik [23]; it is needed in Section ITTl 

Theorem 5.4. Suppose that dfx = and e*^ t— > L{9) is analytic in 
a neighborhood of 83. Then Pni^) ~^ ^{z)~^ uniformly in a neighbor- 
hood of 3. Moreover, the Verblunsky coefficients obey |a„| < C2e~^"'^'^ 
for some A > 0. 

Proof. Analyticity says < Ce^^^'^^ for some A > 0. So, by (j5.2p . 
D(z) is analytic and nonvanishing in a disk of radius e^. In particular, 
if 

oo 

D{z)-' = Y,d,,-iZ^ (5.14) 

j=0 

then 

\dj,-i\ < (5.15) 
Plug (IK?Ti|l into (IKTTT|l and note that 
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for /c = 0, 1, . . . , n. Thus 

oo „ 

\an\ <K^Y1 l^'^'-il / *n+i(e^^)e^''^ dfi 

k=n+l 

oo 

< /too||$n+l|| ^ 14 -l| 

k=n+l 

oo 

< /too ^ 14-1 1 

k=n+l 

since ||$n+i|| < 1- So, by dSHS)), 

By and |$;(e^^)| = |$„(e^^)|, we have 

sup|$* (e^^)|<(l + |a„|)sup|<l>:(e^^)| 
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(5.16) 



(5.17) 



j=0 



< 



C4 < oo 



(5.18) 



by iteration. C4 < 00 follows from ()5.17|) . Since $* is analytic, we get 

(5.19) 



sup|<l>:(;.)| <C4 



and thus, since ^n{z) = 2;" $„(l/2;), we get 

z G C\D |$„(2)| < C4|^r 

Returning to ^TT^ . 

00 00 

J2\K+ii^)-Kiz)\<Y.K\\Mz) 



(5.20) 



ra=0 



n=0 



< 



C4C*2 



2;e 



-A/2\n 



n=0 



showing $*, and so <y9*, converges uniformly in {z \ \z\ < e^^'^}. Since 
the limit is in D, it is in this larger disk. □ 

Finally, in terms of D, we want to rewrite the second term in the 
Szego asymptotic formula ()1.4|1 : 
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Theorem 5.5. Letdfi = e^^^^^ with L e . Let Lk he given by ()1.3|) . 
Then 

2 

(fz (5.21) 





dD 


/ \Diz)\-' 


dz 


J\z\<l 



where both sides can he infinite. 
Proof. fl5.21|) follows by taking r | 1 in 

k=l 



TC 





dD 


[ \Diz)r 


dz 


J\z\<r 



d^z 



(5.22) 



(by using monotone convergence). To prove ()5.22|) . note that by ()5.2|) . 

^D(zy 



log 



[DiO) 



k=l 



converges uniformly in \z\ < R and that \D\ = \-§^^ogD(z)\'^. 

Thus ()5.22|) follows from 

1 



z'^-^z'-^ d^z = k-'Sker 



and 



^ ■l\z\<r 

d 



dz 



log D{z) = J2kLkz''-' 



k=l 



□ 



6. Extending the Strong Szego Theorem, Part I 



With the Szego function and Fatou's lemma, we have the tools for 
the Golinskii-Ibragimov [12j half of the extension theorem: 

Theorem 6.1 ([12 ). Suppose for any BS measure, dfi, we know that 

G{dfi) =exp(^k\LkA (6.1) 
^ k=i ' 

Then for any measure dfi = e^^ with L E , we have 

G{dii)>exp(^k\Lk\A (6.2) 
^ fc=i ^ 

Remark. In particular, G < oo ^ ^l-^fcP < oo. 
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Proof. Let dfi^'^^ be the BS approximations to dfi. Let D^^\z) be the 
D function for dfi^'^^ and let 



.llzKl 



dz 



d'z 



(6.3) 



and similarly for M and D. 
By flTT7|l and (ITTfll . 



G{dfi 



N-l 

(^)^ = n (1 

i=o 



— a. 



so G{djj,^^^) is monotone increasing to G{dfx), that is, 

^(ci/x) = lim (6.4) 

On the other hand, since (fN+jiz, d/j,^^^) = z^LpN{z,dn^^^), 
yp^+^.(2;,rf/i(^) = y;^(2;,rf/iW) = ¥^^(z,rf/i), so by (ED, 



DW(z)=^^(z,d/i)-i 



and thus, by ()5.8j) again. 



(6.5) 



(6.6) 



uniformly on compacts. By analyticity, the same thing is true for 
derivatives, so 



\D(z) 



dD 



dz 



lim |Z}(^)(z)|-2 



N^oo 



dz 



Thus, by Fatou's lemma, 

M < liminf M(^) 

By hypothesis, 

^(rf/i^^)) = exp(M(^)) 
dniHI), (inill), and (D imply that 

G{dfi) > exp(M) 

which is (jFT^ . 



(6.7) 
(6.8) 



□ 
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7. The Coulomb Gas Representation 

In this section, we will provide an integral formula for Dn that will 
be critical in the next section. This formula appeared in Szego's first 
paper [211 asymptotics of Toeplitz determinants. He used it there 
for a minor technical purpose — and for us, too, it plays a relatively 
minor role. That said, it plays a central role in two proofs of the strong 
Szego theorem and several applications. 

While we will not pursue the Coulomb gas picture, if one uses \zj — 
Zk\ = exp(log|2;j — 2;fc|), the formula we give for _D„ says it is the partition 
function of a two-dimensional gas, and this point of view is the basis 
of Johansson's proof ^Tj. It also explains some interest in Toeplitz 
matrices in the physics literature [20, 2T| ITUj. 

Using Weyl's relation that Haar measure restricted to the classes of 
U(n+1), the group of nxn unitary matrices, is essentially (27r)~"'~-^[(n+ 
1)']"^ IIfe<j|e'^' - e'^' P d9o... d9n with {e'^'}]=Q the eigenvalues of U e 
U(n + 1), one can use the formula below to rewrite Dn as an integral 
over U(?T, + 1). This is both the starting point of the Bump-Diaconis ^ 
proof and of the many applications of Toeplitz matrices in the theory 
of random matrices [24j . 

It is, of course, well-known that nA;<j('^fc ~ ^i) ^ Vandermonde 
determinant. Expanding two such products, we get 

2 n 



0<k<j<n 



7r,(TeS„+i j=0 



where S„+i is the permutations of {0, ... , n} to itself. 

Setting Zj = e*^-'' and integrating dfj,{9o) . . . dfj,{9n), we get that 

» n 

j\7T{zk-zj)\'df,{9o)...d^i{9n)= Yl (-ir(-irn^-o>o) 

7r,o-GS„+i j=0 

(7.2) 

where Tk£ are the matrix elements of the (n + 1) x [n + 1) Toeplitz 
matrix. Now 

n n 
j=0 j=0 

and (— l)'^ = (—1)°"'^ \ Thus, summing over a for tt fixed and 
then over tt, we see the right side of ()7.2|) is {n+l)\Dn{dfi). Specializing 
to dfj, = e^^, we have proven 
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Theorem 7.1 (Coulomb Gas Representation for -D„). Let , L E 

Li(aD, f ). Then with Zk = e^^^ 

271 

j=0 



8. Extending the Strong Szego Theorem, Part II 

The Coulomb representation and Fatou's lemma are the tools for 
Johansson's half of the extension theorem. A measure of the form 

where L is a real Laurent polynomial (i.e., 'Yl^=-n^k€-^^^ with L_k = 

Lk) is called a GI measure (after its early use in [I^)- If dfi = 
with L G L^, we define the GI approximations, dfi(^N), by 

d^^^) = exp^ ^ L.e^'^^] ^ ^ exp(L(;v)(^^)) ^ (8.1) 

^\k\<N ^ 

Theorem 8.1 ([T7"). Suppose for any GI measure, dfi, we know that 



G(c//i)=exp( k\Lk\'^ ] 
^ fc=i ^ 



Then for any measure dn = with L E , we have 



G{dfi) <exp(^k\Lk\A (8.3) 
^ k=i ' 

Remark. In particular, if ^Y^=\ ^l-^fcP < oo, then G(d[i) < oo. 

Proof. Without loss (since we can multiply d^ by a constant), suppose 
1/0 = 0. Given dfx = e^^, let djj,(j\f) be its GI approximations and let 

q^),. = [(n+1)!]- / n |%-..reS-o^<-)(^^) n (8-4) 

0<i<A:<m j=0 

SO, of course, C(Ar),m = Dm{dfi(j\f)) by Theorem 17.11 Let Cm be the 
integral with L(7v) replaced by L. 

There is nothing to prove in ()8.3p if Yl'^^i k\Lk\'^ = oo, so suppose the 
sum is finite. Thus, L(jv) ^ L in L^, and so, we can find a subsequence 
L(jVj) — *■ L pointwise. By Fatou's lemma, for each m. 

Cm < liminf CmA,m (8.5) 
By Theorem lT.H (|2.16|) . Lq = 0, and the assumption for GI measures, 

C(Nj),m = Gm{dfi(Nj)) 
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< G{dlJ(^Nj)) 

= expf ^k\Lk\^ j 
^ k=i ' 

< exp( j 

^ k=i ' 

By dHUl) and U = 0, 

Gm{dn) = Cm< expf ^k\Lk\^\ 
^ fc=i 

Taking m — oo yields (j8.3|) . □ 

Theorems 16 . 1 1 and 18 . 1 1 reduce the proof of the sharp form of the strong 
Szego theorem to proving the resuh for BS and GI measures. In both 
cases, dfi = with e*^ t-^ L{9) analytic in a neighborhood of dD. In 
the last three sections, we will prove the strong Szego theorem in that 
case. 

9. The CD Formula 

If ui, U2 solve —u'- + Vuj = XjUj with Uj{0) = 0, then (Xi — X2)uiU2 = 
{uiu'2 — U2u[)', so 



(Ai — A2) / ui{x) U2{x) dx = Ui{a)u'2{a) — u'i{a)u2{a) (9.1) 
Jo 

The Christoffel-Darboux formula is just the analog of this Wronskian 
relation that many undergraduates learn! 

Theorem 9.1 (CD Formula). Let 

n 

i^„(z,C) = $^^¥^,(^) (9-2) 

j=0 

Then 



I .X V^n+l(C) <^n+l(^) - V?n+l(C) <^n+l(^) 
^n(2,C) = \ T (9.3) 

^ ^^<(C)-<^^<^n(^) .g 

1 - 
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Proof. By using (j3.22|) / (j3.23j) and their conjugates to write (Pn+i and 
(Pn+i in terms of and we get 



(9.5) 

where we used p„^(l — janp) = 1, and that the ip*ip cross terms cancel 
First of all, ()9.5j) says that ()9.3j) is equivalent to ()9.4|1 . In addition, 
if we let Q„(z,C) be what we know is the common value of the right 
sides of ()9.3p and ()9.4|) . then subtracting ()9.3p for n — 1 from ()9.4p for 
n, we see that 



g„(^,C)-Qn-i(^,C) = (!-<) 



V5n(C)V5n(2;) 



SO we need only prove Qo{z,() = Ko{z,()- Since ()9.4|) for n = is 1 
and that is Ko{z, (), ()9.3|) is proven. □ 



Corollary 9.2. For e*^ G 9©, 



ie\\2 



e\\2 



r=l 



(9.6) 



Proof. In (j9.3p . take z = ( = re*^ and take r | 1. If we note 

, we get two terms: the one from 



\(Pn+i[re' 



,2n+2 



the -(r2'^+2 _ _ gi^gg ^j^g ^ l)|(^;^^(e^^)|2 term in (j^ . 

and the other terms give the derivative in ()9.(ij) . □ 

The CD formula for OPUC is due to Szego 32j. Our proof is taken 
from Golinskii [T^ . 

Remark. Integrating both sides of ()9.(jj) yields 

d , . 



dr 



ie\\2 



(9.7) 



r=l 



10. The Feynman-Hellman Theorem for Toeplitz 

Determinants 

Here we want to begin with a simple but useful formula for 
^||$„||^^^, where the measure dfi\ depends smoothly on A. This ap- 
peared in j2Hl, strongly motivated by closely related ideas of Baik et 
al. 0. 
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Theorem 10.1. Let dfix be a family of positive measures on which 
are in A in the sense that A h-* Cn{dfix) is for each n. Then 
\\^n{- ,dfix)\\^,^ is and 



^log||$„| 

Proof. Since ^log||$„|p = 
d 



Ma 



d\ 



\^n{e'\d^^x){' 



(10.1) 



;^||$„|p)/||$n|P, it suffices to prove that 



Since 



d\ " ^ 



IMA 



j \^n{e"\d^i^ 



dX 



<l>n{e'^,dfix)'^n{e',dfix)dn 



(10.2) 



;io.3) 



its derivative is a sum of three terms, of which one term is the right 
side of ()10.2|1 and the other two are 



d_ 



<l>„(e*^d/iA)$n(e'^rf/iA)rf/i 



(10.4) 



and its conjugate. 

But, for all A, is monic, so is a polynomial of degree at 

most n — 1 and so orthogonal to $„. It follows that the term in p().4j) 
is zero. □ 

Remark. Those familiar with the Feynman-Hellman theorem 
will see a metaphysical link to this proof. 

We want to do two things with po.lj) . First, we sum several logs 
in \ogDn{dfi) = X]j=o ll'^j IP (by ()2.4p ). and second, we make an 
explicit choice of dfiy- 

Theorem 10.2. Let Wt be a family of L^ functions, in t so \ogWt 
is also in t, and Wq = 1. Then 



log Dn Wi 



2^ 



(n + i)iog(||$„+i||?=i; 



dt 



d_ 
dt 



{log Wt] 



d_ 

dr 



d^t{6) 



r=l/ 

10.5) 



Proof. We have that 

d dO w 
— Wt — = — 
dt 271 Wt 



Wt— = — dfit = ^ (log Wt) d^t 



d 
di 
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Using this, (jTlD, (ITinil . and the definition (0 of K, we get 



d 
di 



log Dn Wt 



d9 
2^ 



d 
'di 



(log Wt) 



K{e'\ e''- dfit) dfit (10.6) 



Now use fl9.6|) to get two terms. If we integrate dt, the first term gives 
the integral in ()10.5p . The second term can be integrated using pO.H) 
to give the first term in (jlU.Sp . □ 

11. Completion of the Proof 

We complete the proof, following by showing: 

Theorem 11.1. Let d^{0) = e^(^)£ where e*^ h-» L{e) is analyt 
neighborhood ofd3. Then 

oo 

lim [logD„(d/i) -{n + l)Lo] = V fc|Lfep 



%c in a 



;ii.i) 



k=l 



Proof. Without loss, we suppose j dfi{6) = 1. We claim first that 
(under the analyticity assumption) 



lim (n + l)[log||<l>„+if -Lo] 

n— >oo 

For in terms of the Verblunsky coefficients. 



;ii.2) 



1$ 



n.+l I 



j=0 



— a,- 



(by (j2lIH)), while dUI} and (jTrijl say that 



j=0 



— a, 



Thus 



I'^'n+lllV^" 



2\-l 



n (i-K-n 

j=n+l 

= 1 + 0(e-^"/') 

by (jOTjl . This proves (flT^ . 

Thus, by (|1(J.5|) . (|11.1|) is equivalent to there being a choice Wt with 



- / dt 



log Wt 



d_ 

dr 



<+i(re^';rfyUt) 



r=l 



d/ii(^)-.5^A;|4|2 
11.3) 



k=l 
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Define for < t < 1, 



Cf = log 



tL{e) ^ 
2n 



and 



wt{9) = exp(tL(^) - ct) 
so J (i/it = 1 also. Thus, the Szego function for fj,t is given by 



log Dt{z) =t\ogD{z) 



■Ct 



11.4) 

;ii.5) 

;ii.6) 
;ii.7) 



fc=i 



It follows that Dt{z) is analytic in a t-independent disk, and then 
the Verblunsky coefficients obey 

|«„(rf/it)| <Ce-'^"/2 

uniformly in t G [0, 1], and then, from the proof of Theorem 15.41 that 
(2;, (i/Xf) — i> Dt{z)~^ uniformly in t G [0, 1] and 2; in | \z\ < e^^^}. 

Thus, the ^ derivative in the integral on the left side of (jll.Hj) converges 

to a ^ derivative of Dt. 
We also write 



d 

2K+ipRe — log(¥?;+i 



-2|Ar'Re( t-?- logD 



dr 



and we can use \Dt\ dfit = 



By 



i^ogwt = m-^^ct 



11.8 



;il.9) 



Since the J^q is 6'-independent and by using ()9.7p . we see the can 
be integrated to ci — cq = 0. Thus, in ()11.3|) we can replace -j^log Wt 
by L{0). The result is that 



LHS of (ITT3|1 



dt / 2tL{e) Re 



d_ 

dr 



\ogD 



de 
2^ 



;ii.io) 



The only t-dependence is the 2t and 2tdt = 1 , so 

^ L^e^"' Re( ^fcLfce^'^M £ 

-fc=-oo -I ^ fc=l ^ 

« / 00 \ p 00 



fc=— 00 



fc=— 00 



2^ 
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oo _ oo 

A;=— oo 1 

as claimed. □ 
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